๐Ÿฟ๏ธ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
๐ŸŽฏ Vector Quantization

Product Quantization, Embedding Compression, Memory Efficiency, Approximate Search

Scaling Laws for LLM Based Data Compression
lesswrong.comยท11h
๐Ÿ“Text Compression
Trainable Dynamic Mask Sparse Attention
arxiv.orgยท14h
๐Ÿง LLM Inference
NeuralMorse โ€“ Reinventing Morse Code with Neural Networks
masatohagiwara.netยท27mยท
Discuss: Hacker News
๐Ÿ“Text Compression
Principal Component Analysis (PCA) is the gold standard in dimensionality reduction.
threadreaderapp.comยท3h
๐Ÿ“ŠVector Clustering
LeetCode #70: Climbing Stairs
anmoltomer.bearblog.devยท14h
๐ŸงฎSMT Solvers
Real-time neural video codec โ€“ 100 FPS 1080p and 4K videos
github.comยท12hยท
Discuss: Hacker News
๐Ÿ”ฌRaBitQ
Welcome GPT OSS, the new open-source model family from OpenAI!
huggingface.coยท18h
๐Ÿ“ฑEdge AI Optimization
SAT Requires Exhaustive Search
link.springer.comยท22hยท
Discuss: Hacker News
๐ŸงฎSMT Solvers
Beyond Manually Designed Pruning Policies with Second-Level Performance Prediction: A Pruning Framework for LLMs
arxiv.orgยท14h
๐Ÿง LLM Inference
Context Guided Transformer Entropy Modeling for Video Compression
arxiv.orgยท14h
๐Ÿ“ŠEmbeddings
Open Sourced: ML Interview Questions and Job List (Ranked by Comp and Culture)
github.comยท1hยท
Discuss: Hacker News
๐Ÿง LLM Inference
E-VRAG: Enhancing Long Video Understanding with Resource-Efficient Retrieval Augmented Generation
arxiv.orgยท14h
๐Ÿ“ŠEmbeddings
Information Rates of Approximate Message Passing for Bandlimited Direct-Detection Channels
arxiv.orgยท14h
โ„น๏ธInformation Theory
Filtering with Self-Attention and Storing with MLP: One-Layer Transformers Can Provably Acquire and Extract Knowledge
arxiv.orgยท14h
๐Ÿง LLM Inference
Kernel-Based Sparse Additive Nonlinear Model Structure Detection through a Linearization Approach
arxiv.orgยท14h
๐Ÿง LLM Inference
Attention was never enough: Tracing the rise of hybrid LLMs
ai21.comยท6hยท
Discuss: Hacker News
๐Ÿง LLM Inference
Simple Methods Defend RAG Systems Well Against Real-World Attacks
arxiv.orgยท14h
๐Ÿ’พPersistence Strategies
Accelerating multiparametric quantitative MRI using self-supervised scan-specific implicit neural representation with model reinforcement
arxiv.orgยท14h
๐Ÿ“ŠEmbeddings
Hessian analysis with JAX: a platform-agnostic, high-performance approach
lesswrong.comยท13h
๐Ÿ•ฏ๏ธCandle
Dataset Condensation with Color Compensation
arxiv.orgยท14h
๐Ÿ“ŠEmbeddings
Loading...Loading more...
AboutBlogChangelogRoadmap